大型並行處理器程式設計：實務導向：硬體排序的困境

在高性能硬體中，速度就是生命。想像一顆 GPU 正在執行 Z-緩衝：它必須每秒排序數百萬個深度值，以判斷哪個像素在前方。為了達成此目標，工程師仰賴 無符號數字比較器，一種簡化電路，可從最左位（MSB）到最右位（LSB）逐位處理，且無任何認知負擔。

標準的二補數無法通過這種「笨拙硬體」的測試。因為負數的符號位為 1，正數則為 0，所以像 -1（111...）在位元層面上看起來比 +1（001...）還大。這造成了 不連續性，迫使硬體使用複雜且較慢的條件邏輯來判斷數值大小。

為恢復效率，我們採用 偏移編碼 （偏置表示法）。透過將範圍平移，使最小可能的值對應至 000... ，而最大值則對應至 111...，確保位元模式能唯一識別一個數值，使其 字典序 完全符合其數值順序。

此特性讓『笨拙』的硬體比較器能立即處理『聰明』的浮點資料。

TERMINALbash — 80x24

> Ready. Click "Run" to execute.

QUESTION 1

What is the primary architectural advantage of using Excess Encoding for exponents?

It allows the use of simple, fast unsigned comparators.

It increases the total range of representable numbers.

It eliminates the need for a sign bit in the mantissa.

It prevents overflow in integer addition.

QUESTION 2

In a 3-bit Two's Complement system, which bit pattern appears 'larger' to an unsigned comparator: -1 or 0?

-1 (pattern 111)

0 (pattern 000)

They appear equal.

Neither, the comparator crashes.

QUESTION 3

What property is defined by 'as the bit pattern increases, the represented decimal value also increases'?

Associativity

Monotonicity

Normalized Representation

Truncation

QUESTION 4

Why is the sign-bit discontinuity a problem for GPUs?

It forces sorting to happen on the CPU instead.

It necessitates complex, multi-stage conditional logic for every comparison.

It makes depth values (Z-buffer) impossible to store.

It causes bit-flip errors in the LSB.

QUESTION 5

If we use Excess-127 for an 8-bit exponent, what bit pattern represents the value 0?

00000000

01111111

10000000

11111111